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(54) Automated creation of application data paths in storage area networks 



(57) A DataPath Engine coupled to a SAN provides 
automated storage provisioning between an application 
on a Storage Area Network (SAN) attached server and 
a data volume on a SAN attached storage subsystem. 
The apparatus provides a simple user interface that al- 
lows operators to use pre-created policies for criteria to 



select data paths that meet organizations uptime and 
performance requirements. The apparatus uses pathing 
methodologies to select the optimal data path from the 
candidates by rating SAN state, uptime, performance, 
and other key factors. This apparatus allows an enter- 
prise to more efficiently and effectively manage and 
monitor large, complex, distributed SANs. 



Block Oiagram illustrating a system for performing Automated 
Provisioning within a SAN 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The invention relates generally to the field of s 
managing storage in a networked storage environment 
and, in particular, relates to the systems and methods 
for provisioning storage and paths, including devices in 
the path, to storage in storage networks. 
[0002] Traditional computer storage architectures 
consist of dedicated storage devices connected to cor- 
responding computer servers, an example of which is 
illustrated in Fig. 1. However, the direct attachment of 
storage to corresponding servers does not support shar- 
ing of the storage infrastructure by multiple servers. 
Thus, this architecture has major limitations, such as, 
for example, capacity and performance constraints, dif- 
ficulty of management, reduced availability, and suscep- 
tibility to disasters. 

[0003] The Storage Area Network (commonly known 
as "SAN") is a relatively new storage architecture that 
provides any-to-any connectivity between servers and 
storage at high (e.g., gigabit) speeds, allowing an enter- 
prise to scale and manage its storage infrastructure in- 
dependently of its server resources. For purposes of this 
invention, SAN is defined to include a storage network 
connecting a distributed and heterogeneous collection 
of compute servers, storage infrastructure, and special 
purpose, storage-service appliances. Fig. 2 illustrates a 
conventional SAN architecture 200 having computer 
servers 202, a switching network 204 and a storage in- 
frastructure 206. Switching network 204 provides bi-di- 
rectional coupling between servers 202 and storage in- 
frastructure 206. Thus, SAN architecture 200 provides 
"any to any" connectivity between any servers 202 and 
a device of storage infrastructure 206, allowing hetero- 
geneous collections of servers to share the same stor- 
age infrastructure. Information technology (IT) manag- 
ers are implementing SAN technology to increase per- 
formance, consolidate and share storage resources, in- 
termix best of class storage components, manage their 
storage in a centralized manner, and gain the availability 
features. In particular, the SAN architecture supports 
powerful new data management concepts in support of 
backup, replication, and availability. For example, such 
"management" typically includes some or all of discov- 
ery, monitoring, topology creation, naming, persistence, 
data volume creation, switch security, data volume se- 
curity, and authentication. A data volume is a single log- 
ical unit exported by a storage subsystem to a computer 
server. It may consist of many physical discs, portions 
of discs, or a data tape, but looks like one contiguous 
space to a server. 

[0004] Unfortunately, conventional SAN manage- 
ment tools are typically server-based and have little or 
no inherent knowledge of the SAN special-purpose 
hardware and services such as replication services, 
snapshot engines, and virtualizers or SAN appliances. 



Conventional tools do not relate SAN storage and switch 
devices to applications and lines of business, and they 
don't automate the many processes required to allocate 
storage to applications. As a result, the actions required 
to be taken by IT management to put SAN data man- 
agement concepts into practice are very complex, and 
are becoming even more so as SAN infrastructures be- 
come more complex. 

[0005] Today, most enterprises create data paths us- 
ing a collection of device specific tools while referring to 
spreadsheets for SAN device information. Some enter- 
prises have written scripts to address the complexity of 
storage provisioning in a SAN. These scripts have to 
modified whenever new equipment is added to the SAN 
and executed by hand. In many cases they have select- 
ed one security mechanism rather than using all of the 
mechanisms to reduce complexity, in some cases, cor- 
porate databases have been created to maintain the 
settings of all the devices in the SAN although many or- 
ganizations still use paper documents to keep track of 
current settings. 

[0006] The conventional server-based SAN manage- 
ment tools typically provide only device-specific task 
management capability. Fig. 3 illustrates an example of 
the steps a SAN administrator would have to execute in 
order to provide a SAN attached server storage access 
using current un-automated, device-based SAN man- 
agement tools. 

[0007] At step 251 , the SAN administrator physically 
verifies that the server is connected to the disc storage. 
At step 252, the SAN administrator finds a data volume 
of sufficient size. At step 253, the SAN administrator 
sets mapping of the selected data volume. At step 254, 
the SAN administrator selects one or more physical con- 
nections from the server to the storage subsystem with 
the data volume. At step 255, the SAN administrator ver- 
ifies that the physical connection is good. At step 256, 
the SAN administrator configures the switch by setting 
security. Step 255 and 256 are repeated by the SAN ad- 
ministrator for every switch used. Finally, at step 258, 
the SAN administrator configures the host bus adaptor 
(HBAs), within the servers. 

[0008] Thus, as can be seen from the example of Fig. 
3, the SAN administrator that administers a convention- 
al SAN has a lot of storage infrastructure knowledge and 
uses many different tools in order to effectively manage 
the SAN. 

SUMMARY OF THE INVENTION 

[0009] The present invention provides an efficient so- 
lution to the problem of implementation and manage- 
ment of a SAN. Preferred embodiments of the invention 
provide for both an apparatus and a method for auto- 
matic provisioning of storage to servers within a SAN. 
An operator, rather than a highly trained storage and 
switching expert, is able to perform automated provi- 
sioning which results in the creation of a data path be- 
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tween a server and data. A preferred embodiment of the 
present invention discovers and saves details of the 
SAN architecture; including, for example, server config- 
urations, processes executable on specific servers and 
association of the processes with the server, devices 
and configurations of the switching network, and devic- 
es and configurations of the storage architecture. De- 
vices, as used herein, are defined to include, for exam- 
ple, disc storage subsystems, tape storage subsystems, 
storage switches, storage routers, SAN appliances, and 
other storage devices such as solid state disc, etc. 
[0010] Not only is static information determined, but 
dynamic information and state information as well. In the 
preferred embodiment, a DataPath Engine is provided 
which initiates, controls and monitors the discovering, 
saving, using, configuring, recommending and reporting 
associated with the preferred embodiment. The DataP- 
atn Engine calculates the optimal data path based upon 
Iho rules or policies specified and information learned 
about the SAN, including policies and rules defined in 
prcconfigured templates for interaction with the DataP- 
atn Engine. As used herein, the term template is defined 
to include, for example, a list of defined rules and poli- 
cies which define the storage characteristics and data 
path characteristics that must be used by the DataPath 
Engine for selection of a data path. The template is cre- 
ated in advance by an administrator using a graphical 
wizard, for example. 

[001 1 ] A preferred embodiment of the present inven- 
tion is directed towards a method of creating a data path 
fo r a process executing on a server coupled to a storage 
area network (SAN). The method includes parameteriz- 
ing a set of attributes for a desired data path between 
the process and a device of the SAN; and constructing 
the data path that provides the set of attributes. For pur- 
poses of this application, the term attributes includes de- 
ta Is about data volumes, security settings, performance 
scttirgs. and other device and policy settings., and pa- 
rameterizing is defined to include defaults selected by 
the system to help the administrator make better choic- 
es wicn creating a template which reflects data path 
policy and rules., with parameterizing attributes refer- 
ring to an abstraction of the configuration, implementa- 
tion and creation steps to identify the desired end prod- 
uct without necessarily specifying implementation de- 
tails. 

[0012] An alternate preferred embodiment of the 
present invention provides for a method of configuring 
a SAN. The method includes discovering, by use of a 
DataPath Engine coupled to the SAN, processes that 
arc operable on a server coupled to the SAN; discover- 
ing, by use of the Data Path Engine coupled to the SAN, 
devices that are included in the SAN ; responding, by use 
of the DataPath Engine coupled to the SAN, to a data 
path construction request from a user by providing the 
user with an interface to accept a set of attributes for a 
desired data path for one of the discovered processes; 
and constructing, by use of the DataPath Engine cou- 



pled to the SAN, the data path that provides the set of 
attributes. 

[0013] In one aspect, the method of a preferred em- 
bodiment includes discovery of SAN device details; ac- 

5 cepting policy and input regarding the type and size of 
data volume and path desired; and finding candidate da- 
ta paths and volumes that meet the policy. The term pol- 
icy and rules are important parts of the preferred em- 
bodiment of the present invention. Policy is defined to 

10 include actions which the DataPath Engine will take 
based upon events in the SAN and the term rules is de- 
fined to include characteristics of data volumes and data 
paths that the DataPath Engine must use to select can- 
didate data paths for the application. 

15 [0014] The data path may contain multiple channels 
or threads. A thread is a logical relationship representing 
a physical path between the server on which the appli- 
cation is resident and all of the devices, connections, 
ports and security settings in between. Further, for pur- 

20 poses of this application, threads are defined to include 
one or more of, depending upon the needs of the em- 
bodiment, application id, server id, HBA port id, HBA id, 
HBA security settings, switch port ids, switch security 
settings, storage subsystem port id, data volume id, data 

25 volume security settings, SAN appliance port id, SAN 
appliance settings. These relationships include, but are 
not limited to, the data volume; the storage subsystem 
the volume resides on; all ports and connections; 
switches; and SAN appliances and other hardware in 

30 the data path; the server with the Host Bus Adapter 
(HBA) where the application resides; and all applicable 
device settings. The data path selection is based upon 
policies such as, number of threads : number of separate 
storage switch fabrics that the threads must go through, 

35 level of security desired and actions to take based upon 
security problems detected, performance characteris- 
tics and cost characteristics desired. Data paths are cre- 
ated from SAN devices automatically discovered by the 
DataPath Engine (Applications, Servers, HBAs, Switch- 

40 es, Fabrics, Storage Subsystems, Routers, Data Vol- 
umes, Tape drives, Connections, Data Volume security, 
etc.). The data path can have multiple threads to the 
same data volume and span physical locations and mul- 
tiple switched fabrics. 

45 [0015] In another aspect, a preferred embodiment of 
the present invention is a method including an appara- 
tus for selection and creation of the optimal data path 
among the candidate data paths. Pathing methodolo- 
gies within the DataPath Engine use discovered infor- 

50 mation about the SAN such as device uptime informa- 
tion, performance information, cost information, and 
load. Device uptime information is defined to include, for 
example, the collection and persistence of data about 
when a device or connection or service is in service and 

55 available versus unavailable. Performance information 
is defined to include the collection and persistence of 
data about how each device is moving data from its lo- 
cation to the next and the resulting rate of data that the 
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application is experiencing. Cost information is defined 
to include the correlation of cost information with actual 
devices used within a data path taking into consideration 
the percentage of the device used by the data path, 
when multiple data paths share the same devices. Cost 5 
in this case is a calculation of the infrastructure used. 
Many customers purchase expensive hardware in 
hopes of achieving high levels of availability. A calcula- 
tion of the cost (switch ports used, HBA ports used, # 
gigabytes used) for each path may be provided for cus- 
tomers to analyze their cost for performance and avail- 
ability achieved, as well as used as a "rule" when cre- 
ating data paths. 

[0016] Best practices information is also factored in 
as appropriate for the discovered devices, such as fan 
out ratios and switch fabric architecture which impacts 
performance once a new data path is added to the in- 
frastructure. 

[0017] Implementations of preferred embodiments of 
the invention provide for one or more of the following: 
[0018] An operator with no storage or switch training 
uses a wizard on a graphical viewer to provision storage. 
Provisioning storage is defined to include creating a da- 
ta path for a software application on a SAN attached 
server to a new or existing data volume. 
[001 9] Data path creation rules and policies are spec- 
ified in a pre-created template. The template is expand- 
able to include new rules as they are defined. A policy 
or template is created by a SAN administrator to meet 
the availability and performance needs of a software ap- 
plication on a SAN attached server. At storage provi- 
sioning time the proper template is automatically select- 
ed for the application. In some cases, predefined tem- 
plates are provided and do not require creation by the 
SAN administrator. In some instances, for example, this 
is appropriate for standard implementations of common 
processes/applications. 

[0020] Data path thread selection logic uses pathing 
methodologies that take into consideration the learned 
state and usage of the SAN. 

[0021 ] Once the selected data path is approved by the 
operator, the DataPath Engine automatically configures 
SAN devices for data path creation across multiple de- 
vices, networks and locations. 
[0022] Implementations of automated storage provi- 
sioning include but are not limited to, creation of data 
paths for an application, discovery of pre-existing data 
paths, reconfiguration of data paths, movement of data 
paths between asynchronous replications, and tuning of 
data paths based upon data collected about the SAN's 
performance and uptime. Advantages of the invention 
include the ability for a small number of operators to 
manage large, complex and distributed storage net- 
works. They do not require detailed knowledge of stor- 
age, networking devices or extensive training. Manual 
procedures and policies are automated for a huge time 
gain and reduction of personnel. Pathing methodologies 
calculate the best data paths rather than relying on ex- 



perts or operator memory to select the optimal path dur- 
ing setup. Complex storage networking hardware and 
services can be added to storage networks and quickly 
incorporated into new or existing data paths. 
[0023] Incorporating new devices into new or existing 
data paths is preferably done by adding new "rules" to 
the template. For example, a special purpose storage 
replication system is added to the SAN. A rule would be 
added to the DataPath Engine that allows a template to 
be created which selects a data path with the specified 
replication systems. New paths would be created using 
this template. Existing paths would be changed by data 
re-pathing. Re-pathing would allow an operatorto select 
an alternate path to the existing volume or a replica of 
the volume (in another location) using a selected tem- 
plate. This capability, then, will support switching be- 
tween replications as well as incorporating new devices 
into existing paths by changing to a new template (policy 
and rules). This also supports a change in performance 
characteristics and path optimization via a template with 
different settings than the original one used. 
[0024] In an alternate preferred embodiment, the Da- 
taPath Engine stores the parameterization data or ac- 
cesses a store of parameterization data used in the 
specification of existing data paths (including policies/ 
templates/rules) used in guiding the generation of each 
existing data path. Periodically (automatically or opera- 
tor initiated), the DataPath Engine reruns the pathing 
methodologies based upon the stored parameters to de- 
termine whether a new optimal data path exists. De- 
pending upon specific embodiments, the data path may 
be changed automatically or the user may be requested 
to authorize the use of the new data path, 
[0025] Automated storage provisioning is a powerful 
system for enterprises with extensive storage networks 
to reduce their personnel requirements and better utilize 
theirstorage infrastructure. As used herein, the term au- 
tomatic means that all the underlying SAN infrastructure 
and settings are configured by the DataPath Engine 
without administrator intervention based solely on a re- 
quest specifying an application, data volume size and 
template.. The above description refers to the construc- 
tion of a data path. 

[0026] Further understanding of the nature and ad- 
vantages of the invention may be realized by reference 
to the remaining portions of the Specification and Draw- 
ings. In the drawings, similarly numbered items repre- 
sent the same or functionally equivalent structures. 



[0027] The invention will be described with respect to 
particular embodiments thereof, and reference will be 
made to the drawings in which: 

Fig. 1 illustrates the "direct attach" storage architec- 
ture of the prior art; 

Fig. 2 illustrates the Storage Area Network (SAN) 
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architecture of today; 

Fig. 3 illustrates the steps, which a SAN administra- 
tor must perform for the prior art of manually provi- 
sioning storage within the SAN architecture shown 
in Fig. 2; 

Fig. 4 is a block diagram illustrating a preferred em- 
bodiment for a system that performs automated 
storage provisioning; 

Fig. 5 illustrates the steps the DataPath Engine and 
an operator to perform automated storage provi- 
sioning within the SAN architecture shown in Fig. 4; 
Fig. 6 illustrates rules or policies that can be spec- 
ified by a SAN administrator which the DataPath 
Engine uses to select data volumes and data paths 
for an application; 

Fig. 7 is a layout showing connections for Fig. 7A 
and 7B which together illustrate a flowchart of a 
pathing methodology used by DataPath Engine to 
select the optimal data path through a complex SAN 
infrastructure; 

Fig. 8 is an illustration of a graphical user interface 
800 for, in the preferred embodiment, provisioning 
a data path; 

Fig. 9 is an illustration of interface 800 after the op- 
erator has selected 'Next' from interface 800 con- 
figuration illustrated in Fig. 8; 
Fig. 10 is an illustration of interface 800 after the 
operator has selected 'Next* from interface 800 con- 
figuration illustrated in Fig. 9; and 
Fig. 11 is illustrates an alternate preferred embodi- 
ment for creation of a data path. In this case, a data 
path is desired to an existing data volume, already 
being used by another server. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

[0028] Fig. 4 illustrates an embodiment in accordance 
with the invention in which DataPath Engine 302 is pro- 
vided as part of (or closely coupled to, depending upon 
a particular embodiment) the SAN infrastructure; 
switching network 204, SAN attached servers 202, and 
storage infrastructure 206. DataPath Engine 302 oper- 
ates in response to high-level SAN management com- 
mands entered by the operator via a Management Sta- 
tion 304 using a graphical user interface. The DataPath 
Engine is preferably operated as part of a general pur- 
pose computer server running any of several types of 
operating systems as well-known art; with one or more 
processing units; connected to the Wide Area Network 
(WAN) and switching network 204; and having attached 
a persistent repository 306. The DataPath Engine exe- 
cutes best with world-wide connectivity to SAN infra- 
structure via WAN and fibre channel connections and 
can be clustered with multiple computer servers for high 
availability and work load balancing. The management 
station is preferably operated as part of a specially pro- 
grammed general purpose computer, such as a PC, 
workstation, mini-computer or other computing platform 



running any of several types of operating systems as 
well-known in the art. As well known, these platforms 
include one or more processing units for executing in- 
structions obtained from memory (volatile or nonvola- 
5 tile) and typically under control from signals provided 
through an I/O system. A display adapter typically maps 
memory contents of page frames for display on a mon- 
itor. Instructions and data for the executing instructions 
(program) and other data may be in the memory sys- 
10 terns, which also includes removable media such as a 
portable magnetic media (e.g., floppy disk). The 
processing unit is able to communicate with other 
processing systems (e.g. the servers in the server ar- 
chitecture, devices in the switching/routing architecture, 
*5 or devices in the storage architecture, typically by em- 
ploying conventional networking protocols such as TCP/ 
IP for example. These systems are well known to per- 
sons of ordinary skill in the art and will not be further 
explained herein. 

[0029] DataPath Engine 302 is coupled to switch net- 
work 204 and WAN 208 and obtains knowledge of the 
identity and behavior of the specific devices in storage 
infrastructure 206 and applications on the SAN attached 
servers 202. This information is embedded, incorporat- 
ed or otherwise associated within DataPath Engine 302 , 
saved to a persistent repository, and obtained either by . 
automated discovery or through manual configuration. 
Automated discovery includes polling and broadcasting 
across the WAN and switch network for devices to ini- 
tially find devices; to find new devices added to the SAN 
infrastructure, and to continually collect status on the de- 
vices. The configuration and use of DataPath Engine 
302 allows device management specifics to be hidden 
from the operator as it handles all the details of individual 
device management. 

[0030] Fig. 5 illustrates procedures for automated 
storage provisioning by an operator using DataPath En- 
gine 302. (This is in contrast to the discussion above 
concerning Fig. 3, which illustrates path allocation using 
conventional SAN management.) At step 351 , DataPath 
Engine 302 automatically discovers SAN device config- 
uration and status. At step 352, the operator selects a 
create data path command from a graphical interface 
along with policy. The embodiment of the command type 
is "Create a Virtual Private DataPath". A "Virtual* Private 
DataPath (VPD)" is the term representing a data path 
coupled with a data volume meeting the template rules, 
and monitored constantly in order to implement policies 
specified. At step 353, DataPath Engine 302 selects the 
best candidate data volumes that meets the policy cri- 
teria. At step 354 the optimal candidate data paths are 
displayed for operator selection and the data path is set 
up based upon the operator's action. A data path repre- 
sents the number of threads created between the appli- 
cation server and the data volume. 
[0031] In this preferred embodiment, the pathing 
methodologies prioritize certain selection requirements 
over other possible candidates. In other embodiments, 
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different selection requirements may produce different 
optimal paths. Optimal refers to a best fit of available 
resources to parameterized attributes based upon ap- 
plicable prioritization conditions. In other embodiments 
and under different conditions, the recommended or op- 
timal data path could be different, so best is used is a 
relative sense as opposed to an absolute sense. 
[0032] Thus, it can be seen that an operator using the 
system illustrated in Fig. 4 can manage the SAN without 
having detailed knowledge of the SAN devices and 
quickly provision storage with a two-step command. 
[0033] Fig. 6 illustrates rules that can be implemented 
as policy for data path selection. The policy includes, 
but is not limited to: Storage Type, 451, Number of 
Threads or Channels, 452, Number of Fabrics, 453, 
Bandwidth, 454, Data Volume Security, 455, Data Vol- 
ume Action, 456, Fabric Security, 457, Fabric Action, 
458. HBA Security, 459 : HBA Action, 460, Data Volume 
Si/e. 461, DataVolume Pooling, 462, Data Volume 
Characteristics, 463 : and Data Path Status, 464. 
[0034] Fig. 7 illustrates a pathing methodology proc- 
ess 550 setting forth a preferred embodiment of how Da- 
taPath Engine 302 in Fig. 4 selects an optimal path. 
Process 550 first reads a data path policy 551 , obtains 
a current SAN state 552, gets an application name and 
volume size desired from an operator 553, and initiates 
a search for data paths from each of the server's HBA 
ports 554. 

[0035] The data path search 554 loops through each 
server HBA port 555, using the following steps: checks 
whether a port is used by any other data path 566; when 
already used, then gives the port a weighting 557, 558; 
when not used, checks to see whether used by another 
trread of this data path 559; and when not then gives 
the port the best ranking 560; and adds the port to can- 
didate list 561 . 

[0036] Once all candidate ports have been identified, 
DataPath Engine 302 determines whether the number 
of HBA ports is equal to or greater than the number of 
threads specified in the policy 562, and when not, it fails 
(Step 563). Otherwise it searches for storage subsys- 
tems that match the policy requirements 564. 
[0037] The data volume search loops through each 
storage subsystem 565, and determines whether there 
arc suitable data volumes 566. The DataPath Engine 
302 loops through each data volume 567, and when the 
si/c is acceptable 568 : and the data volume is accessi- 
ble 569. it gives the data volume a ranking 571 . When 
no data volumes are found 572, then it fails 573. All data 
volumes inaccessible from the server HBA are then dis- 
carded 574. 

[0038] Next DataPath Engine 302 ranks each switch 
fabric 575, and computes the shortest data path 576. 
For each HBA port on the server 577 ; it finds the shortest 
route through the fabric to the data volume 578, and 
ranks the data path 579. Finally, DataPath Engine 302 
calculates and sorts data paths by their ranking 580, 
581. 582. 



[0039] Fig. 8 is an illustration of a graphical user in- 
terface 800 for, in the preferred embodiment, parame- 
terizing a set of attributes for a desired data path. Inter- 
face 800 is coupled to DataPath Engine 302 and may 
s be part of management station 304. The preferred em- 
bodiment uses a simple set of data entry screens 800 
and those shown in Figs. 9-11 , to guide an operator in 
the definition and creation of an optimal data path (the 
path from the application on the server to the data vol- 
10 ume on a storage subsystem). Interface 800 includes 
data based upon the context from which the operator 
requested to create the optimal data path. For example, 
when the operator has the owner view for 'Trading 
Floor" currently opened, the owner information of inter- 
ns face 800 contains "Trading Floor." The operator is able 
to override any field. In some cases, interface 800 is pre- 
populated (or preloaded) with policy information or tem- 
plate information. 

[0040] Fig. 8 through Fig. 1 0 illustrate a data path cre- 

20 ation scenario for a preferred embodiment. Fig. 8 de- 
scribes an interface 800 that is populated with informa- 
tion to serve as an example. Interface 800 includes a 
data path template "GoldService" identifying and spec- 
ifying rules about data path creation that hides the com- 

25 piexity and details of the storage and switching network 
architectures from the operator. The template includes 
information such as the number of threads, the security 
levels, default data volume size, etc. The provisioning 
can be done on behalf of an application executing on 

30 one server or a cluster of servers. In this example the 
operator is creating a data path from the application, Or- 
acle, which is executing on a cluster consisting of three 
servers (DBserverl, DBserver2, DBserver3). The size 
requested is 72GB. The owner of the data path is the 

55 Trading Floor. Included in interface 800 is the concept 
of data path owner, which for purposes of this applica- 
tion, is defined to include a name of a customer or line 
of business within an enterprise and information about 
that organization such as a contact person, their email 

^0 address for automatic notification, their address, appli- 
cations which are associated with the owner and default 
templates that are associated with the owner. 
[0041] Fig. 9 is an illustration of interface 800 after the 
operator has selected 'Next* from interface 800 config- 

45 uration illustrated in Fig. 8, if "automatically select best 
VPD" is not checked. After the operator selects Next, 
the DataPath Engine 302 searches the SAN and switch- 
ing infrastructure for candidate data volumes and the 
best data paths from each server to candidate data vol- 

50 umes. The best data path candidates will be selected 
using the pathing methodologies specified in Fig. 7. Fig. 
9 illustrates that interface 800 now displays a set of data 
path candidate sets, including best data path candidate 
(determined as the candidate data path matching the 

55 rules and policies most closely) to each server, and al- 
ternates. The operator selects an alternate set or con- 
tinues to the Next screen which automatically selects 
the default best candidate data paths. 
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[0042] Fig. 1 0 an illustration of interface 800 after the 
operator has selected 'Next' from interface 800 config- 
uration illustrated in Fig. 9. Once the operator selects 
Next from the previous interface 800 configuration, in- 
terface 800 provides the operator with an opportunity to 
name the data path and to provide a description. After 
the DataPath Engine 302 creates the identified data 
path, the DataPath Engine 302 notifies the operator. 
[0043] Fig. 11 illustrates an alternate preferred em- 
bodiment for creation of a data path. In this case, a data 
path is desired to an existing data volume, already being 
used by another server. This is a common case with 
clusters of servers that must all access the same data 
for a shared application. In this case interface 800 is 
configured as shown in Fig. 11 and is used in place of 
interface 800 illustrated in Fig. 8. Other than the data 
volume preexist, the subsequent steps in the data path 
creation process do not have to be changed. 
[0044] It should be noted that the embodiments de- 
scribed here may be implemented in hardware, soft- 
ware, firmware or some combination thereof. While par- 
ticular embodiments have been described, the scope of 
the invention is not to be limited to any particular em- 
bodiment. Rather, the scope of the invention is to be de- 
termined from the claims. 



Claims 

1 . A method of creating a data path for a process ex- 
ecuting on a server coupled to a storage area net- 
work (SAN), comprising: 

parameterizing a set of attributes for a desired 
data path between the process and a device of 
the SAN; and 

constructing the data path that provides said 
set of attributes. 

2. The method of claim 1 wherein said set of attributes 
includes a pre-defined template. 

3. The method of claim 2 wherein said set of attributes 
includes a data path owner, application, and the 
server or servers on which the application is exe- 
cuting. 

4. The method of claim 2 wherein said pre-defined 
template specifies a set of performance, availability, 
and cost metrics for the desired data path. 

5. The method of claim 4 wherein said set of perform- 
ance and availability metrics includes at least one 
of a number of threads, a security level, and a de- 
fault volume size and characteristics, default path 
characteristics. 

6. The method of claim 1 wherein said parameterizing 



step includes a step of entering a user-defined at- 
tribute for inclusion in said set of attributes. 

7. The method of claim 6 wherein said entering step 
5 includes entry of said user-defined attribute by use 

of a graphical user interface coupled to the SAN. 

8. The method of claim 1 wherein said constructing 
step further comprises: 

10 

searching the SAN for a set of candidate devic- 
es; 

constructing a candidate data path from the 
server to each candidate device of said set of 

15 candidate devices; 

evaluating each said candidate data path 
against a selection metric to rank said candi- 
date data paths from a best candidate data path 
to a least best candidate data path according 

20 to said selection metric; and 

selecting said best candidate data path as the 
data path to be constructed by said construct- 
ing step. 

25 9. The method of claim 1 wherein said constructing 
step further comprises: 

searching the SAN for a set of candidate devic- 
es; 

30 constructing a candidate data path from the 

server to each candidate device of said set of 
candidate devices; 

evaluating each said candidate data path 
against a selection metric to rank said candi- 

35 date data paths from a best candidate data path 

to a least best candidate data path according 
to said selection metric; 
presenting said ranked candidate data paths to 
a user for selection; and 

40 selecting a user-selected candidate data path 

as the data path to be constructed by said con- 
structing step. 

10. The method of claim 9 wherein said presenting step 
45 recommends said best candidate data path for se- 
lection by said user. 

1 1 . The method of claim 1 0 wherein said best candidate 
data path is presented as a default selection at said 

50 selecting step. 

1 2. The method of claim 9 wherein said selection metric 
includes device uptime information. 

55 13. The method of claim 9 wherein said selection metric 
includes performance information. 

1 4. The method of claim 9 wherein said selection metric 
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Includes cost calculation. 

1 5. The method of claim 9 wherein said selection metric 
includes best SAN practices information. 

5 

16. The method of claim 9 wherein said selection metric 
includes learned state and usage information of the 
SAN. 

17. The method of claim 9 wherein said searching step 10 
prequalifies a subset of candidate data paths by 
finding those candidates that satisfy a pre-created 
policy prior to application of said evaluating step. 

1 8. The method of claim 1 wherein said constructed da- *5 
ta path includes all physical, logical and security 
component identification and configuration inf orma- 
tion sufficient to operably link the process to an 
identified data volume of the SAN. 

20 

19. A method of configuring a SAN, comprising: 

discovering, by use of a data path engine cou- 
pled to the SAN, processes that are operable 
on a server coupled to the SAN; 25 
discovering, by use of said data path engine 
coupled to the SAN, devices that are included 
in the SAN; 

responding, by use of said data path engine 
coupled to the SAN , to a data path construction 30 
request from a user by providing said user with 
an interface to accept a set of attributes for a 
desired data path for one of said discovered 
processes; and 

constructing, by use of the DataPath Engine 35 
coupled to the SAN, the data path that provides 
said set of attributes. 

20. Apparatus forcreating a data path for a process ex- 
ecuting on a server coupled to a storage area net- *o 
work (SAN), comprising: 

means for parameterizing a set of attributes for 
a desired data path between the process and 
a device of the SAN; and «5 
means, coupled to said parameterizing means, 
forconstructing the data path that provides said 
set of attributes. 

so 
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Steps to manually provision storage for a server (Prior Art); 

STEP 251 

A SAN administrator physically verifies that the server is 
connected to the storage subsystem by checking the fibre channel 
cabling from the server host bus adapter (HBA) to the fibre 
channel switch(es) and on to the target disk subsystem. 

STEP 252 

A SAN administrator uses a storage subsystem tool (or SAN 
management tool) to find a data volume of sufficient size. 

STEP 253 

A SAN administrator uses a storage subsystem tool to set 
mapping of the selected data volume to the desired host. 

STEP 254 

A SAN administrator makes a best guess as to the usage of 
various switched fabrics and paths and selects one or more paths 
from the server to the data volume. 

STEP 255 

A SAN administrator uses the SAN switch tool to verify that the 
physical connection is good by checking the name server data to 
see that the server (HBA) has successfully logged into the switch. 

STEP 256 . 

A SAN administrator uses the SAN switch tool to configure the 
fabric by setting the switch zones between the storage port and 
server HBA port 

STEP 257 

Step 255 and 256 are repeated by the SAN administrator for every 
fabric used between the server and the data volume. 

STEP 258 

The SAN administrator uses the HBA interface tool to configure 
the HBA's on the server to set masking and mapping for the data 
volume that the server needs to access. 



FIG. 3 
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Block Diagram illustrating a system for performing Automated 
Provisioning within a SAN 




WAN 208 



FIG. 4 
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Steps for Automated Provisioning of Data Paths: 

STEP 351 

DataPath Engine software logic discovers all physical devices 
including switches, storage subsystems, servers, HBAs, data 
volumes and SAN appliances; all connections through the SAN 
including switch connections, server HBA connections and storage 
subsystem connections; ail settings including switch zones, data 
volume maps, security settings, and SAN state; and displays the 
graphical topology of the SAN. 

STEP 352 

An Operator selects a create data path command from a graphical 
interface. The graphical interface provides a wizard that guides the 
operator through the data path creation and requests the operator 
to select an application and size of data volume (or existing data 
volume). Selection rules and policy is pre-created for each 
application. 

STEP 353 

DataPath Engine 302 selects the best data volume that meets the 
policy criteria. Best selection pathing methodology takes into 
consideration the state and activity of the SAN. DataPath Engine 
302 discovers and selects physical paths through the SAN fabric 
from the selected data volume to the target application host that 
meet the criteria. Best selection pathing methodologies select the 
best path based upon the state and usage of the SAN. 

STEP 354 

The selected data paths are displayed for the operator. If the 
operator accepts the selection, DataPath Engine 302 configures 
settings on the HBA, storage device, data volume, and fabric 
switches to maximize security and allows accessibility to the data 
volume. The path settings and configuration data are saved. 



FIG. 5 
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Example Rules that can be used for Data Path Selection criteria 



EXAMPLE 
RULES 


DESCRIPTION 


EXAMPLE 
RULES 


451: Storage Type 


The end point storage type 


Disk, Tape SSD, etc. 


452; Number of 
Threads or 
Channels 


Sets the data path rule to the nuber different physical 
connections between server and data Volume that 
must be selected. Each Thread requires a different 
HBA port on the Application Server. 


1-10 


453: Number of 
Fabrics or WANs 


Sets the rule to the number of different switch fabrics 
to use for selection of each Thread above. Each fabric 
requires a different HBA port in the Application Server, 


1-10 


454: Bandwidth 


Sets the rule as to whether each Thread selected must 
be exclusive or shared. Exclusive Threads mean that 
no other Thread may share any portion of the physical 
connection and ports used. 


Shared or 
Exclush/e 


455: Dat3 Volume 
Security 


Sets the rule as to whether to select a storage 
subsystem where Data Volume mapping is used or 
not Open does not use storage subsystems with 
Volume Mapping. 


Secure or Open 


456: Data Volume 
Action 


Sets the policy for action it a Data Volume map is 
modified by alternate SAN tools: 
Ignore (default for Open), 
Notify only, 

Notify and disable by hiding the Data Volume from server, 
or Notify and override (change it back to original settings) 


Ignore, Notify, 
Disable.Override 


457: Fabric Security 


Sets the rule as to whether to select a switch with 
zoning enabled or not. Open does not use switches 
with zones. 


Secure, Notify or 
Open 


458: Fabric Action 


Sets the policy for action if a switch zone is modified 
by alternate SAN tools: 
Ignore (default fcr Open), 
Notify only, 

Notify and disable by hiding Volume from server, or 
Notify and override (change it back to original settings) 


Ignore, Notify, 
Disable.Override 


459: HBA Security 


Sets the rule as to whether HBA masking or mapping 
is used or not. Ooon doos not use HBA security 
settings. 


Open, Secure 


460: HBA Action 


Sets the policy fur action if a HBA setting is modified 
by non-lnierSAN software, (same as Fabric Action) 


Ignore, Notify, 
Disable.Override 


461: Data Volume 
Size 


Number of Gigabytes. The closest Volume size will bo 
discovered. 


1-99999 


462: Volume Pooling 


Use more than one volume to meet size requirements 


Yes/no 


463: Data Volume 
Char 


Type of RAID characteristics of the data volume 


RAID 0.1,3.5, etc. 


464: Data Path Stat 


Status of data path: Enabled or Disabled 


Enable or Disable 



FIG. 6 
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Figure 7A 
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Figure 7B 



FIG. 7 



EP 1 271 296 A2 



Pathing Methodology for optimal data path selection (Process 550) 



551 



552- 




553 



21 



Get Application 
name, Server name 
Volume Size 



554^ | 



Search each HBA port 



Yes 




HBA port score = 100- 
(tf of SPs using port * 
SP use discount) 



559 



HBA port score = 100 



HBA port score scaled 
by port bandwidlh / 
average data rate 



561 



563 



2l 



J 




Yes 



Add port/score to 
possible HBA port list 



562 




Find Storage 
Subsystems 
matching Storage 
Path template 



Yes 



569-^ Y | s 



Find set of ports on Storage 
Subsystem through which the 
volume can be reached and score 
them as was done with HBA ports. 




Volume score - dosesness to 
requested size scaled by score of 
the ports. Add Volume/Score 
to possible volume fist. 



573 



572- 




Find intersection of Fabrics reachable 
from HBA ports and Fabrics reachable 
from Volume ports. Discard any HBA 
ports/Volumes where the intersection 
is null. 



1 

To next page 
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Pathing Methodology for optimal data path selection (continued) 

675 ^ 



Score each fabric based on the # 
of data p3lhs using the fabric and 
the used bandwidth of the Fabric. 



576^ 



Compute the shortest possible 
Route through the fabric for each 
HBA port. Fabric, Volume port 
combination. 



More HBA ^\ 
Ports? J>" 



Yes 

576S i 



579-^ 



Find shortest route 
through fabric to a 
Volume port. 



T 



No 



Route score = 100 — (7* # cf 
switches in route. 
This score is scaled by the score of 
the Fabric thus short routes in the 
best Fabrics yields the highest 
scores. Add Route to list. 



Sort HBA ports. Routes and 
Volumes by score. 



581- 



I 



Assemble Potential Data Paths 
using the best HBA ports, Routes 
and Volumes, Each Potential Data 
Path is assigned a weight score 
derived from the scores of its 
components. 



582- 



Sort set of Potential Datapaths 
by score. 



FIG. 7B 
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Now Virtual Private DHlapall 1 



Owner: 
Application: 

Seryef(s); 




Help | 



| Trading Floor 



| Oracle 



VPD Template: 



DB Server 1 
DBServer 2 
DBServer 3 



Gold Service 



Volume Size (Gb): [72 [ 
0 Automatically select best VPD 



1 | Select... 1 
~l l Select... 1 



H Properties] 



| Cancel | <Back Next > } \ Rnish ~] 



FIG. 8 



New Virtual Private Datapath: Step 2 of 3 



ghoose a VPD Set meeting your criteria 



Devices used by afl potential VPDs: 



~9VPDset 1 
9VPD 1 

l — ^thread J 

r- ~- MBA 

t p 0f t Number 2 

i — a Fabric 1 
r-a Switch 

i } Port Number 1 

i x Port Number 2 

^-©-Switch 
©Fabric 2 
l --^Disk 1 

} Volume 1 

r Contoilor 1 

i p ort jsj urT1Der -j 

■-■ffl thread 2 
fflVPD2 
OVPDset2 



Hide Topology 




I Help I I Cancel [pBack if^ext > [ | Finish ) 
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|| New Virtual Private Datapath: Step 3 of 3 I 










VPD Set Name: | j 

Status: ©Enabled O Disabled 
Description: 






Help Cancel | <Back |[ Mext> f finish | 





FIG. 10 



New Virtual Priv?it+:j Datapath from Existing Volume: Step 1 of 3 




Volume: 
Inters AN: 
Owner: 

Application: 

Se£ver(s): 



Volume Name 
InterSAN Name 
[Trading Floor 



| Oracle 



DBServer 1 
DBServer 2 
DBServer 3 



0 Automatically select best VPD 



1 1 Select... ] 
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VPD Template: |GoldServlce [g| [Properties 



Cancel 1 1 <Back | | Next> [ 1 finish | 
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